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Description 



The present Invention relates to methods to create a user profile and to specify a 
suggestion for a next selection of a user, in particular to a metiiod to create a 
user profUe that comprises a Ust of word-weight pairs, a method to create an 
individual user proffle from a multi-user profile that comprises a Ust of word- 
5 weight pairs, and a method to specify a suggestion for a next selection of a user, 
which suggestion is determined on basis of suggestion results, which are 
computed of future program descriptions and a user profile. 

Several technologies are known which make an attempt to make "reasonable 
10 suggestions- for all kinds of content selection applications to users of tiiese 
applications. For such suggestion engines usually users and their preferences 
are modelled to present a most likely next content selection recommendation. 
One common way to model users and tixeir preferences is to use predefined 
stereotypes of user groups that characterize tiie users interests concerntog 
15 certain applications and/or topics. Also, the logging of user actions is widely 
used to infer certain characteristics tiierefrom. Such recommendation systems 
are often server-based and collect a lot of explicit ratings from a lot of users to 
build up profiles. Bach user is Uxen located in one of ttiese learned profiles and 
under the assumption tiiat the user will probably also like what the other users 
used to buUd up the profile liked, new suggestions of content selection 
probabilities will be made. 

A model of a user and tiie corresponding preferences is usually a list of word- 
weight pairs. i.e. a Ust of weighted keywords, according to which new possible 
selections are searched to find the best matching one. The paper "Adaptivlty 
through Unobstrusive Learning" by Ingo Schwab and Alfred Kopsa. Kflnstilche 
IntelUgenz. Volume 3-02. pages 5 - 9. ISSN 0933-1875. Arendt Tapp Verlag. 
Bremen describes an approach for learning user profiles which consists of such 
a list of weighted keywords Implicitly from positive user observations only. In 
tills paper several metiiods togetixer with user profiles are described. Further. In 
the paper "similarity measures for short queries" by Ross Witeinson. Justin 
Zobel. Ron Sacks-Davis. Department of Computer Science. RMIT. GEO Box 
2476V. Melbourne 3001. Austi^ia. October 1995. Fourth Text Retrieval 
conference (1995). 277 - 285. information retrieval techniques are described to 
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find articles and databases based on certain keywords. In particular for ad-hoc 
^queriesL-wiiich are .usually short, ofLperhaps..two.to -ten. terms. 

As mentioned above, the known systems compute suggestions based on an 
5 existing user profile and/or a users history, and it might be possible that the 
user profile can be edited directly by the user or it is computed automatically. 
As also mentioned above, another often used mechanism is to cluster groups of 
users according to their behaviour and/or selections, wherein an expUcit rating 
about the contents from these users Is necessary. It is then assumed that what. 

10 the users in the group like will most likely also be a good suggestion for the 
current user. However, all these methods have in common that they make rather 
general suggestions, resulting from the overall history of the user or the group 
he belongs to. In the paper "Program driven music radio**, by Conor Hayes, 
Pddraig Cuzmlngham, Patrick Clerkin, Marco Grimaldl, Department of Computer 

15 Science, Trinity College DubUn, Proceedings of the ECAI 2002 (5*^ conference on 
artificial Intelligence on 21. to 26. July 2002), it is described to find better 
recommendations that are pertinent to a listener's current listening preference 
in streaming audio technology and recommendation techniques, according to 
which the resulting suggestions are filtered according to the current situation. 
20 The current situation here is the users behaviour in a certain (recent) time 
window to assure adaptation of the suggestions to an eventually changed taste 
of the user. However, also in this system the suggestion might be rather general 
to fit the current situation. 

25 Such rather general suggestions that are valid for the user/user group as 
observed over a certain period of time might be inappropriate in certain 



Therefore, it is the object underlying the present invention to provide improved 
30 techniques to generate user profiles and to specif a suggestion for a next 
selection of a user. 

This object of the present invention is solved by a method to create a user profile 
that comprises a list of word-weight pairs according to Independent claim 1, a 
35 method to create an individual user profile from a multi-user profile that 
comprises a list of word-weight pairs according to Independent claim 29, and a 
method to specify a suggestion for a next selection of a user, which suggestion is 



situations. 




MOLLER . HOFFMANN & PARTNER . 3 . 

Sony International (Europe) GmbH 54.509 30«08.2002 



determined on basis of suggestion results which are computed of future program 
descriptions and their user profiQes according to independent claim 33. Preferred 
embodiments thereof are defined in the following subclaims. Further, the 
present Invention provides a computer program product according to claim 38, a 
5 computer-readable storage medium according to claim 39, a profiler to create a 
user profile that comprises a list of word-weight pairs according to claim 40, and 
a suggestion engine to specify a suggestion for a next selection of a user, which 
suggestion is determined on basis of suggestions results which are computed of 
fixture program descriptions and a user profile according to claim 41. A preferred 
10 embodiment of the suggestion engine is defined in claim 42. 

It should be noted that instead of word-weight pairs, 1. e. a list of weighted 
keywords as already explained above also more general structures such as n- 
grams (words depending on the context of the n previous words (excluding 
15 stopwords) can be used. Also phrases containing more than one word are 
possible. These structures are called "key structures*. 

Also a forgetting factor can be used that weights "old" keywords (that result from 
user selection in the past) which are d3mamically weighted lower (depending on 

20 how long ago they show the word occurred in was chosen). In this way the most 
recent selections get a higher weight and thus an adaptation to changes in the 
user's interest becomes possible. Further, the term "future program" should be 
understood as "fixture program or stored personal content". Moreover, the term 
"program description** is meant to comprise also a "description of the selection, 

25 e, g. TV program abstract**. 

The method to create a user profile that comprises a list of word-weight pairs 
according to the present invention comprises the step of computing the weights 
based on user featiures that represent a t3rpical general behaviour of an 
30 individual user in respect to the application where the user profile is used. 

The method to create an individual user profile from a multi-user profile that 
comprises a list of word-weight pairs according to the present invention 
comprises the step of at least once splitting the multi-user profile according to 
35 user features that represent a typical general behaviour of an Individual user in 
respect to the application where the user profile is used. 
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The method to specify a suggestion for a next selection of a user, which 

suggestlon-is-determlnecLoa-basis-. of - suggestion results, .which -are-^omputed-^of . 

future program descriptions and the user profile according to the present 
invention comprises the step of filtering a user history which is used to create 
5 the user profile, and/or a suggestion results based on an actual situation of the 
user represented on basis of user features that represent a typical general 
behaviour of an individual user in respect to the application where the user 
profile is used. 

10 Therewith, according to the present invention the user profile as well as the 
suggestions computed based thereon are obtained taking a special set of user 
features into account. The user features are defined to represent a typical 
general behaviour of an individual user in respect to the application where the 
user profile is used. In other words, for each application where a user profile is 

15 used a special set of user features are defined which are able to represent a 
typical general behaviour of an individual user. Based on these user features the 
weights in the list of word-weight pairs or weighted keywords are computed or 
influenced during the creation of the user profile, a multi-user profile is split 
during the creation of an individual user profile from a multi-user profile, or 

20 during specification of a suggestion a user history which is used to create the 
user profile, and/or the user profile, and/ or the suggestion results are filtered. 

The user features are chosen to be able to represent a typical general behaviour 
of an individual user in respect to the particular application, i.e* to be able to 

25 characterize predetermined schemes or rules according to which the user 
usually makes his selections. By initially considering the selection history like in 
prior art systems it is possible that the methods according to the present 
invention more and more adapt to the behaviour of an individual user, but not 
mainly based on finding characteristics in the selections of the user, but rather 

30 by improving a simulation of the user's interest. 

Depending on the application, different features might be helpful/necessary to 
infer from users action what he actually likes. These features are different fi-om 
recent "state-of-the-arf*. By choosing these particular features a more complex 
35 reasoning can be conducted that helps to compute user preferences on a deeper 
level. 
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Furthermore, the features in the history are used to compute dynamically a 
specific profile based on an, actual ,user request. Assume for example the user, 
watches different movies on working days (maybe shorter ones and not too 
exciting) as on weekends. Furthermore, on weekends his watching behaviour in 
5 the afternoon is different from the night. So if on a certain Saturday night the 
user requests a movie all relevant entries are selected from the history and out 
of these a very specific profile as the basis for the current suggestion is 
computed. 

10 Alternatively, the profile computed from the general history can be used to make 
an initial suggestion of a number of items (e. g. TV shows). Then this can be 
further filtered based on the features in the history. 

The method to create a user profile according to the present invention preferably 
15 comprises the step of evaluating an Influence of the user features firom a user 



Alternatively or additionally, the method to create a user profile according to the 
present invention preferably comprises the step of dividing the user profile into 
20 sub user profiles according to characteristics of stereotype user profiles to 
obtain more specialized profiles. 

Further alternatively or additionally, the method to create a user profile 
according to the present invention preferably comprises the step of determining 
25 words to be Included in the Ust of word-weight pairs on basis of words included 
in program descriptions of previous selections by the user. 

In this case, the method to create a user profile according to the present 
invention preferably comprises the step of determining further words to be 
30 included in the list of word-weight pairs on basis of a database evaluation of co-- 
occurrences of words already included in the list. 

Alternatively or additionally in this case» the method to create a user profile 
according to the present invention preferably comprises the step of computing a 
35 weight of a word to be Included in the list of word-weight pairs on basis of a 
comparison of an inverse document frequency of said word with respect to all 



history. 
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available program descriptions and an Inverse document frequency of said word 
.- jsvlth .respect to .program jdescriptionsu^of-prevlous-selectlons. by the usee* 

Further alternatively or additionally in this case, the method to create a user 
5 profile according to the present Invention preferably comprises the step of 
computing weights of words to be included in the list of word-weight pairs as a 
product of 

• importance of the word with respect to all program descriptions of 
previous selections by the user, 

10 and at least one of the following terms: 

• a constant term, in particular one, 

• Importance of the word with respect to the program description in which 
the word is included, 

• Importance of the word with respect to all available program descriptions, 
16 and 

• a correctional factor that depends on the length of the program 
description in which the word is included and the frequency of the word 
in this program description. 



20 In the method to create a user profile according to the present invention, the 
importance of a word with respect to all program descriptions of previous 
selections by the user is preferably determined on basis of the portion of the 
word in the set of all words of the program descriptions of previous selections by 
the user. 

25 

In the method to create a user profile according to the present invention, the 
importance of a word with respect to all program descriptions of previous 
selections by the user is further preferably determined under consideration of 
the program description in which the word is included. 

30 

In the method to create a user profile according to the present invention, the 
importance of the word with respect to the program description in which the 
word is included is preferably determined on basis of occurrences of the word in 
said program description with respect to occurrences of the word in a wider 
35 database. 
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In the method to create a user profile according to the present invention, the 
importance, of. the word with respect to all available, program descriptions Is 
preferably determined on basis of its Inverse document frequency In respect to 
all available program descriptions. 

5 

In the method to create a user profile according to the present Invention, the 
Importance of a word with respect to all available program descriptions Is 
further preferably determined under consideration of the program description in 
which the word is included. 

10 

In the method to create a user profile according to the present invention, the 
step of computing weights of words to be included in the list of word-weight 
pairs might be performed according to the following formula: 

15 M;,(0 = (l + ^log(-^ + l))-aaog(^^^ 

wherein Wi(t) is the weight of a word t in a program description 1, a is the 
importance of the word t in the all program descriptions of previous selections 
by the user, t is the word. f|. t is the number of occurrences of the word t in the 
20 program description i, fi is the document length of the program description 1, ft is 
the number of program descriptions containing the word t, and N is the number 
of program descriptions in the database. 

In the method to create a user profile according to the present invention, the 
25 step of computing weights preferably comprises a step of normalizing a 
computed weight of a word included in the list of word-^weight pairs by a 
behaviour of the user to stay with a selection and to switch selections off, 1. e. a 
zapping behaviour. Also, to turn on Just for this selection and to switch off 
exactly at the end plays an important role. 

30 

In the method to create a user profile according to the present invention, the 
step of computing weights alternatively or additionally preferably comprises a 
step of normalizing a weight of a word included in the list of word-weight pairs 
by a behaviour of the user when to actually consume a recorded selection. 

35 

In the method to create a user profile according to the present invention, the 
step of computing weights further alternatively or additionally preferably 
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comprises a step of averaging the weights of a word over all program 

^d^scjiipJ;lojti§ of preyiaus. J5ele.ctions.„.by..the-..user Jn jwJhlch . the .word-Occurs. These ^. 

are the weights w(t) computed as the average out of the weights Wj(t). 

5 In the method to create a user profile according to the present Invention, the 
step of computing weights still further alternatively or additionally preferably 
comprises a step of excluding words from the list of word-weight pairs which 
show a weight below a predetermined threshold or within a predetennined lower 
range of weights. 

10 

In the method to create a user profile according to the present invention, the 
step of computing weights still further alternatively or additionally preferably 
comprises a step of computing a weight of a word to be included in the list of 
word-weight pairs by considering negative selections of the user. 

15 

In the method to create a user profile according to the present invention, the 
step of computing weights still further alternatively or additionally preferably 
comprises a step of computing a weight of a word to be included in the list of 
word-weight pairs by considering a maximum frequency of possible user 
20 selections in respect to certain user features. 

In the method to create a user profile according to the present invention, the 
created user profile is preferably used as a query in possible future program 
descriptions to suggest at least one possible future selection to the user. 

25 

In this case, in the method to create a user profile according to the present 
invention preferably for each possible future program description a modified 
OKAPI weight is computed, wherein each matching word has its weight as a co- 
factor so that the search result is influenced according to the user profile. 

30 

In this case, in the method to create a user profile according to the present 
invention further preferably said modified OKAPI weight is computed according 
to the following formula: 

35 OKA^(q,i)^ 2 ^(^).iog(j^).( 

ft + -ifi fo^Si^fi ) 
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wherein w(t) is the weight of a word t over all program descriptions of previous 
. selections- by the. user, q is a query built according to the user profile, 1 are all 
program descriptions of possible future selections. w(t) is the weight of the word 
t, namely the average of Wi(t), fj, t is the ntunber of occurrences of the word t in 
5 the program description 1, fi is the document length of the program description i, 
ft is the number of program descriptions containing the word t, and N is the 
number of program descriptions in the database. 

In this case, in the method to create a user profile according to the present 
10 invention alternatively further preferably said modified OKAPI weight is 
computed according to the following formula: 

OKA^^iqA)^ X MO (l+^log(-^ + l)). 

15 wherein w(t) is the weight of a word t over all program descriptions of previous 
selections by the user, q is a query built according to the user profile, 1 are all 
program descriptions of possible future selections, w(t) is the weight of the word 
t, f|. t Is the number of occurrences of the word t in the program description i, 
and fx is the document length of the program description i. 

20 

In this case, in the method to create a user profile according to the present 
invention alternatively or additionally further preferably the query is built by 
combining a stereotype user profile and the created user profile, wherein 
initially, when providing the suggestion for the first time, the stereotype user 
25 profile is user alone, within a first predetermined period during the collection of 
data for computing the created user profile a linear combination of both user 
profiles is used, and after said first predetermined period during the collection of 
data for computing the created user profile the created user profile is used. 

30 In the method to create a user profile according to the present invention the 
stereotype user profile might comprise a generic stereotype profile, which 
describes an average interest in everything, and at least one specific stereotype 
profile each defining interests of a user who focuses on a specific topic. 

35 In the method to create a user profile according to the present invention 
preferably a weight with which a specific stereotype user profile is considered 
during the linear combination to provide said suggestion is computed by 
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matching the user selections with the data in the respective specific stereotype 
'and-McYeaslng the-weighti^f t stereotsrpe if-aTnatch is-found^ 

Also, Initially several stereotype profiles for different user groups are used« After 
5 a few observation of the behaviour or a particular user a more specific 
stereotype Is computed as linear combinations of the initial stereotypes. 

In the method to create a user profile according to the present Invention 
preferably after a second predetermined period the query is built by an 
10 Individual user profile created as defined and described below. 

In the method to create a user profile according to the present invention 
preferably the query Is built under consideration of a filtered user profile as 
defined and described below. 



In the method to create an Individual user profile from a multi-user profile 
according to the present invention preferably for splitting the multi-user profile 
the following steps are performed: 

a) performing a tentative split according to user featiu-es to generate a first 



b) calculating the relative difference between the two sub user profiles, 

c) performing steps a) and b) until all or a predetermined number of 
tentative splits are performed, and 

d) splitting the multi-user profile according to that tentative split that yields 



predetermined threshold. 

In the method to create an individual user profile from a multi-user profile 
according to the present invention during said spUtting preferably said relative 
30 difference is calculated by calculating a difference of a first discrete probability 
distribution of the flirst sub user profile over the user features that are contained 
therein and of a second discrete probability distribution of the second sub user 
profile over the user features that are contained therein. 



15 



20 



and a second sub user profiles. 



25 



the highest relative difference in case said relative difference lies above a 



35 



In the method to create an individual user profile from a multi-user profile 
according to the present invention during said sphtting further preferably said 
-difference of said two discrete probabiUty distributions is calculated using the 
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symmetrized KuUback-Lelbler-distance sum» where events which happen zero 
times are -replaced by one virtual occurrence. - 

Each of the above described methods according to the present invention Is 
5 preferably used In an audio/video program suggestion engine. 

In each of the above described methods according to the present invention 
further preferably said audio/video program suggestion engine is Internet based. 

10 Bach of the above described methods according to the present invention is 
preferably client based, 1, e. it is Implemented on a consumer device like IV, set- 
top-box, VCR. home server, rather than on an Internet server or similar public 
server. 

15 In each of the above described methods according to the present invention 
further preferably said user features comprise one or more of the following 
features: 

preferred channel of audio/video program consumed by the user, 
typical time to consume an audio/video program by the user, 
length of consuming an audio /video program by the user in relation to 
the total length of the audio/ video program, 

time of beginning the consume of an audio /video program by the user in 
relation to a start time of the audio/video program, 

typical length of consuming an audio video program by the user in 
relation to the time of consuming. 

relation between how often a particular audio/video program is 
consumable and how often it Is consumed by the user, 

general audio/video program consuming behaviour of the user, in 
particular in relation to a switch-on time and length of a used 
audio/video device, 

audio/video programs recorded by the user, 

time duration between the recording of a particular audio/video program 
by the user and the consuming of said audio/video program by the user, 
actual mood of the user, 

actual wish of audio/video program entered by the user, 
year of production of an audio/video program consumed by the user. 



20 



25 



30 



35 
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• director and/or actor and/or group of actors of an audio/video program 
consumed by the-user; - - — -r 

• type of an audio/ video program consumed by the user, and. 

• title of an audio video program consumed by the user. 

5 

The computer program product according to the present invention comprises 
computer program means adapted to perform the method steps as described and 
defined above when being executed on a computer, digital signal processor, or 
the Uke. 

10 

The computer readable storage medium according to the present Invention 
stores thereon a computer program product as described and defined above. 

The profiler to create a user profile that comprises a Ust of word-weight pairs 
15 according to the present invention is adapted to perform the method steps as 
described and defined above. 

The suggestion engine to specify a suggestion for a next selection of a user, 
which suggestion is determined on basis of suggestion results which are 
20 computed of future program descriptions and a user profile according to the 
present invention is adapted to perform the method steps to specify a suggestion 
for a next selection of a user as described and defined above. 

The suggestion engine according to the present invention preferably comprises a 
25 profiler according to the present Invention. 

In the following the present invention will be described by way of an exemplary 
embodiment thereof taken in conjunction with the accompanying figures, 
wherein 

30 

Fig. 1 shows the arrangement of a suggestion engine for a home 

server user interface. 

Fig. 2 shows the arrangement of a profiler shown in Fig. 1, 

35 

Pig. 3a to 3c show a user profile generation according to a first aspect of the 

present invention and its application to make a suggestion, - - - 
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Fig. 4a and 4b show the adapthig and spUtting of user profiles according to a 
second aspect of the . present invention, and - - 

Fig. 5a to So show three possibilities to interactively influence a suggestion 
5 engine according to a third aspect of the present invention. 

In the following, the present invention is exemplary described in a context of a 
system that gives recommendations for TV programs for a set of users in a home 
environment. The present invention is In particular applicable to such a system, 
since today's users of TV programs are faced with a vast variety of channels and 
shows, between which they have to chose what they want to watch or record. 
Electronic program guides (EPGs) that are available on the internet, on 
information streams accompanying TV broadcast such as Teletext or generally in 
DVB and on set top boxes help and assist the user in making a decision which 
show to watch, however, the number of channels and shows remains too large* 
Therefore, an engine that makes explicit suggestions to the user that fit his 
tastes and habits according to the present invention is particular suited to 
determine suggestions tailored to the particular user in the particular situation. 
Of course, the present invention which is in fact particularly applicable to such 
a TV program recommendation system is not limited thereto, but might also be 
used for recommending other audio/video picture, i.e. multimedia, contents or 
might help to find holiday targets, hotels, travelling routes, books, etc., i.e. 
might assist a user in any situation where a decision has to be made. 

25 Fig. 1 shows the arrangement of a suggestion engine 1 within a home server 
user interface, wherein the suggestion engine 1 is used to provide suggested 
electronic program guide articles, which are retrieved from an electronic 
program guide articles database 4. The suggestion engine 1 comprises a retrieval 
unit 3, which Is connected to the electronic program guide articles database 4 

30 and which provides the suggested electronic program guides article(s) on a 
multimodal user interface 5 including graphics, speech, haptics. touch sense, 
and an animatel life-like character and any subset out of these. The suggestion 
engine 1 comprises additionally a profiler 2. which is connected to the retrieval 
unit 3. The profiler 2 builds a user profile and supplies this as a query to the 

35 retrieval unit 3. which retrieves the articles from the EPG articles database 4 
that are best matching to the query. These best matching articles are then 
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supplied as suggested EPG axUcles to the user via a graphical user Interface 



Fig. 1 also shows an exemplary EPG article 4a of the EPG articles database 4 
5 which Is referring to a television show broadcasted on the date 18.7.2002 with a 
start time of 22:00 h and an end time of 22:45 h on the station "channel 5" with 
the title **extreme sports". Further, the EPG article 4a supplies the description 
The Italian volcanic island of Vulcano provides the gruelling backdrop for some 
tough sports. Including hill-running, kayaking and moimtain-biking** for said 
10 television show. Also, Fig. 1 shows an exemplary query 2a generated as user 
profile by the profiler 2. The query is a set of weighted words, i.e. word-weight 
pairs and includes sports: 2.12, outdoor: 2.04, kayaking: 1.90, tennis: 1.39^ 
adventure: 1.21, western: 0.83, news: 0.78. and gun 0.50. A query might, 
comprise further word-weight pairs, which are not shown. 



Fig. 2 shows the structure used to build such user profUes or . queries as a list of 
word-weight pairs. The profiler 2 has access to a user behaviour database 6 
which comprises EPG articles of the programs the user has watched and to a 
database 7 to retrieve additional information, as e.g. described further below. On 
20 basis of these two information sources the profiler 2 generates a user profile 8, 
which produces queries that e. g. might look like the query 2a shown in Fig. 1. 

The improved way of calculating the weights for the kejrwrords and the 
suggestions resulting therefrom, what also improves the suggestions, according 

25 to the present invention is described in connections with Figs. 3a to 3c. For this 
calculation a special set of user features are taken into account that are also 
used to compute the suggestions according to the present invention. In the 
described exemplary embodiment it is the aim of the invention to be able to 
make a specific suggestion to a user who wants to watch TV. Hie mechanism is 

30 the determination of relevant kejrwords from EPG articles, i.e. program 
descriptions, here respective descriptions of the relevant data and abstracts of 
TV shows, a weighting scheme for the determined relevant keywords, and a 
search, which identifies relevant EPG articles of future TV shows as suggestions. 



15 



35 



In this embodiment the user features that represent a typical general behaviour 
of an individual user might be: 

• preferred channel, ... 
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• typical time (day of week» time of day, weekend, bank holiday, ..•) to 
- watch TV, ^ . - . . - - _ - 

• zapping behaviour, e-g- if a show was watched for a very short time only 
that probably means that the user did not like it, whereas when the user 

5 explicitly switches to a channel at the beginning of a show and watches It 

for the whole time and then switches oft or switches to another channel, 
the user probably likes the show; A weight z[i) can reflect how much the 
user likes a specific show or program with respect to the zapping 
behaviour, 

10 length of the show, e.g. it could make a difference when a movie takes two 

hourst whether the user wants to watch It in the early or late evening, 

• relation between how often a series is broadcasted and how often it is 
watched, 

• time the TV is switched on, i*e. whether it is Just running all day or it is 
15 specifically used for particular TV shows; in the first case the selected 

program might have a lower weight, 

• what was recorded, 

• how long did it take until something that was recorded was watched, 

• how often was a recorded program watched, e-g. was It watched at all; a 
20 corresponding weight r(i) might reflect how much the user likes a specific 

show with respect to these questions, 

• also the mood of the user can be taken into account, 

• year of movie production, 

• director, actor, pair or group of actors, 
25 • title, 

• type, 

• an actual wish of a program entered by the user. 



The typical behaviour, e.g. zapping to a few specific channels, of the user after 
30 switching on the TV can be learned from the past and an immediate suggestion 
after switching on the TV can be made. Since the user features that represent a 
typical general behaviour of the user are influenced from the history and not 
only specific articles of the history are used to generate the suggestions, it is 
according to the present invention possible that something new will be 
35 suggested. 
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Further explicitly surprise suggestions can be made that do not fit the profile on 

^purpose- These- surprises-are-either-chosen-randomly- or near-matches of -the- 

user or profile or dehberately contrary to the user profile or they are obtained 
from an Internet server based on stereotype user profiles or they are included 
5 directly in the EPG. 

In this context the clustering within a profile is also possible, i,e. instead of 
keeping specific keywords to build a profile, a profile could also be composed of 
weighted* very specific profiles, e. g. a profile for classical music, one for 

10 extreme sports, etc. In this way it can be ensured that the suggestion engine can 
generalize data, using the information in the specific profiles. B«g. if a user Ukes 
to Usten to Beethoven, he/she might also like Mozart. This cannot be learned 
from the history if nothing with Mozart was ever selected, but in the specific 
profile of classical music, the correlation between Mozart and Beethoven could 

15 be modelled so that in suggestions according to the present invention also 
Mozart in addition to Beethoven could occur. Such a modelling might be based 
on co-occurrence of words in either the EPG data or in other textual resources, 
e.g. news groups or other articles discussing music, movies or TV programs, can 
be used to extend the set of used kejnvords in the profiles. Also homonyms and 

20 thesaurus could be used. These and other possibilities to specifically look for 
the extension of keywords of a user profile might be included in the database 7 
shown in Fig. 2 to which the profiler 2 has an access. A further example might 
be that supposed James Bond and Michelle Pfeiffer co-occur in such texts, if the 
user has watched James Bond a few times recently, then both James Bond and 

25 Michelle Pfeiffer should be added to his profile. Also, fl-om an Internet server 
such co-occurrence lists can be downloaded in regular intervals. These can be 
computed based on behaviour of several larger user groups in context of the 
recent TV program. 

30 According to the described preferred embodiment of the present invention the 
computation of the user profile using the above-mentioned features and the 
importance of the kejnvords contained in selected EPG items might be computed 
as follows: 

35 The user history consists of the selected EPG articles, that means the EPG 
articles that correspond to the programs the user has watched, together with the 
user features as described above. Fig. 3a shows that in a first step SI the 
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histozy consisting of a set of articles is read. In the examples a first article 
comprises the keywords pop. music, rock. Madonna, music, a second article - 
comprises the keywords Beethoven, symphony, Mozart, classic, a third article 
comprises the keywords: punk, rock, music and a fourth article comprises the 
5 kesrwords: classic, philharmonic, music. 

As described further below, based on a specific user request, e.g. **movle on the 
weekend", the user history can first be filtered such that only EPG articles 
remain that fulfil this request. If a user request Is e.g. -movie on the weekend", 

10 it is first determined if there is a significant difference in the watching behaviour 
between the weekend and working days, before a filtering takes place. A 
significant difference can be determined with respect to e.g. length, type of the 
movie, e.g. action thriller versus love story, channel or other user features. Of 
course, as also further elucidated below, the full user history might be 

15 determined and a filtering according to a specific user request might be 
performed in a later stage. Further alternatively, not the whole history, but only 
the history corresponding to filtered articles might be used to determine the user 
history. 

20 In a following step S2 (does not appear tn Fig. 3a-c} for each word, also referred 
to as term, that is present in at least one of the articles contained in either the 
full or the filtered user history, a weight is computed with respect to each 
article. This weight according to the present invention reflects the particular 
Interest or lack of interest of the word for the user. The weight might reflect the 

25 relation of the user profile to the "normal" frequency of occurrence. The words 
for which the weights are computed might be all words that are present In the 
articles or a reduced set of words, e.g. obtained by filtering with a stop word list 
to e.g. not include articles or other unimportant words. 

30 The weight of a word with respect to an article is preferably the product of the 
importance of the word with respect to this article times the importance of the 
word with respect to all selected articles, i.e. the history, times the importance 
of the word with respect to the complete database times a correctional factor 
that depends on the length of the article and the frequency of the word. An 

35 example is: 



Wi(t) = importance (i, t) * importance (S, t) * importance (D, t) ♦ corr (i, t), 
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where t is the word, 1 the, article. S the selection, i.e. all articles of the user 

histoiy... . and D .the database-. JEhe . mentioned— importance -functions-^ for-^a 

word/article» word/ selection, word/database are usually different. For example, 
the importance of a word relaUve to an article can be the number of its 
S occurrences in the article, the importance of a word relative to the selection, i*e. 
the histoiy, can be the portion of the word in the set of all words in the 
selection, and the importance of a word relative to the database can be its 
Inverse document firequency. As mentioned, these three functions are only 
examples. Different functions might be chosen. 

10 

The function's importance (S, t) and importance (D, t) might also depend on the 
article, i.e. i. For importance (S, t) it is also possible that only a constant factor 
is chosen. Apart firom the importance in respect to the selection, all other factors 
are preferred, but optional, in other words, the usage of importance (i. t),. 

15 importance (D, t), and corr (1, t) for the computation of the weight w(i, t) at all is 
optional. The importance with respect to the complete database which is also 
dependent on the article, i.e. importance (D. i, t), often yields either O, if the 
word t occurs in most documents of the database and therefore is not very 
specific, or yields 1, if the word is very specific only for certain articles. A 

20 continuum of values between O and 1 is possible, reflecting if the word t is more 
or less specific. A concrete example for the above function is: 

w,(0 = (l + |log(-^ + l)).a4og(^^ 

25 where a is the importance of the term t in this selection, t is the term, i.e. the 
word, f,^ t is the number of occurrences of the word t in the article i, f, is the 
document length of the article 1, ft is the number of articles containing the word 
t. and N is the number of articles in the database. 

30 The weight w(i, t) might be normalized by the weights r(i) and z(i), i.e. by the 
weight which reflects how much the user likes a specific show with respect to 
the questions whether a recorded show was watched at all, how often was it 
watched, and how long did it take until something that was recorded was 
watched, and the weight which reflects how much the user likes a specific show 

35 or program with respect to the zapping behaviour. 

If a word t does not occur in a specific article 1, the weight Wi(t) = 0. 
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In Fig. 3a the weights for the words music, classic, rock, and pop in the first to 
fourth articles are exemplary calculated in that the word music has a weight L.2 
* 1 ♦! * 0.9 = 1.08 in the first article. O in the second article. 1 * 1 * 1 * 0.8 = 0.8 
in the third article, and 1*1* I* 0.8 = 0.8 in the fourth article, the word 
5 classic has a weight 0 in the first article. 1*1*1* 0.4 = 0,4 in the second 
article. 0 in the third article, and 1 * 1 * 1 * 0.7 = 0.7 in the fourth article, the 
word rock has a weight l*l*l*0,7s=0.71n the first article. O in the second 
article. 1 * 1 * 1 * 0.8 * 0.8 in the third article, and 0 in the fourth article, and 
the word pop has a weight 1 ♦ 1 • 1 ♦ 0.5 = 0.5 in the first article. O in the 
10 second article, O in the third article, and O in the fourth article. 

In a following third step S3 the weights Wi(t) of a single word t in respect to one 
article is averaged over all articles i of the selection to obtain the final weight 
w(t) for this word« In the example shown In Fig. 3a for the word music the final 
weight (1.08 + 0.0 + 0.8 + 0.8)/4 = 0.67 is obtained, i.e. w(music) = 0.67. for the 
word classic the final weight (O.O + 0.4 + 0.0 + 0.7)/4 » 0.275 is obtained. i.e. 
w(classic) = 0.275. for the word rock the final weight w(t) is calculated to (0.7 + 
0.0 + 0.8 + 0.0)/4 = 0.375 = w(rock), and for the word pop the final weight is 
calculated to (0.5 + 0.0 + 0.0 + 0.0)/4 = 0.125 = w(pop). 

Words that might be added to the user profile based on the above-referenced co- 
occurrence in databases, e.g. text databases, might receive a heuristic weight. 
For example, if the user has watched James Bond movies and this keyword co- 
occurs frequently with Michelle Pfeiffer, then w(Michelle Pfeiffer) will be a certain 
fraction of w(Sean Coimery), if Michelle Pfeiffer never /rarely occurs without 
Sean Connery. 

In a following fourth step S4. which is shown in Fig. 3b. the user profile is 
determined as the set of all terms of the history with the corresponding weights 
30 so that the user profile then consists of a set of word-weight pairs. It can 
contain all words and their weights or only a subset of the words with the 
highest weights might be taken in order to keep the computational overhead 
small. 

35 In Fig. 3b the user profile shows the entries: music: 0.67, rock: 0.375, classic: 
0.275. and pop: 0.125 as calculated in step S3. 
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For simplldly the following function W|(t) that determines the weight of a word t 

Jn^an_article was .use4_for..the_ determination^of-thejuser history -to-the-concrete. 

example shown in Figs« 3a and 3b: 

5 w,(o=(i+|iog(-^+i))-M ( J'^; ^ ), 

2 fu^^|fif^8(^|fi) 

The user profile as generated according to the above description Is then 
considered as a query q which Is used for the search in the database for KPG 
articles corresponding to possible future selections of the user, i.e. to the 

10 possible suggestions made according to the present invention. Generally, any 
available information retrieval algorithm might be used as long as the weight of 
the words in the user profile can be considered as a co-factor to influence the 
search result. In the following, an exemplary embodiment of a modified OKAPI 
weight is described » which OKAPI weight Is computed for each EPG article 

is relevant to fulfil the user's request, e-g- "movie on the weekend". During the 
weight computation each matching word has its weight as a co-factor so that the 
search result is influenced according to the information of the user profile, that 
is the preferences of the user, so that: 

t4^iei ft fut ^4fi ^^gCyJfi^ 

where the articles 1 are now from a complete future database of £PG articles and 
in addition all recorded TV shows. The modification of the formula to determine 
the OKAPI weight (as it is e* g. described in the above paper of Wilkinson et al.) 
25 lies in the insertion of the weight w(t), which is the average of all Wi(t), according 
to the present invention. With this formula those EPG articles get a high weight 
that contain words of the user profile that have a high weight, i.e. that have a 
high importance. 

30 Another possibility to formulate a modified OKAPI weight among other possible 
solutions could be: 

OKA^iq^i)= X v.,(0.(l+llog<-^+l)) 
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Therefore, in a following step S5, which is shown in Fig. 3c, for each article in 
the database corresponding to~ current, future or recorded programs^ i.e.- 
programs that will be available to the user, in particular in a range of time 
specified by the user, e.g. as a pre-setting: "now**, or "on the weekend", the 
5 modified OKAPI weight with the user profile is computed. Exemplary, an i. 
article comprising the words: "classic music" would yield in an modified OKAPI 
weight 0.945, a j. article which comprises the words -rock music" would yield a 
modified OKAPI weight 1.045, and a k. article which comprises the words 
"brltish music" would yield the modified OKAPI weight 0.67. For reasons of 
10 simplicity the following formula: 



was used to compute these values, where q is the profile and i the article of the 
15 database. 

Alternatively or in addition, the following scheme for the determination of 
relevant keywords and their weighting can be applied which determines relevant 
and Irrelevant keywords. This determination is based on the known 

20 determination of the importance of a term (or keyword) using a (function ofi 
inverse document frequency Idf, wherein idf is defined as the munber of 
documents in the whole database divided by the number of documents which 
contain the term In question. Usually the logarithm £X>G 0 of this value is taken 
and referred to as idf. If a term has a high idf, this basically means that it is 

25 injBrequent in the doctunent collection and probably very indicative in its 
meaning for the few documents that contain it. 

According to the present invention the collection of the BPG data of the shows 
that the user has watched so far can be seen as a separate document collection 

30 and idf statistics might be computed for all terms contained In this user 
database. Then, the two idf statistics, i.e. the one computed according to the 
known technique and the one based on the EPG data of the shows that the user 
has watched so far, are compared to each other. If a word has a similar 
relevance in both document collections, this indicates that this word does not 

35 discriminate very well between the "average" TV watcher, as characterized by the 
entire EPG database, and the specific user, and therefore should not be given 
too much weight. As an example, the user happens to watch, say, news Just as 
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often as the average user does, and this is not. particularly relevant for the user 



. _ ._prafile.. JCherjefore.^suchJerms-are given aUower .weight.^ although they- might-be 

rare and would otherwise be given a high weight. On the other hand, if a word 
occurs with a lower idf weighty i. e. a higher document frequency, in the user 
5 specific database as compared to the global EPG database, it can be concluded 
that the user is definitely deviating from the average user with respect to this 
very kesnvord, and would probably like to watch more programs corresponding to 
this keyword in the future. Therefore, the word weight will be increased in such 
cases. If a given word is selected significantly fewer times than expected by the 

10 user, an -anti-keyword" property can be assigned, 1. e. corresponding to 
programs that the user wants to see less frequent than the average, which 
therefore should be avoided in recommendations. The so-determined word 
weight can and should be used together with the standard idf weight as 
computed from the main database. By the combination, the highest wel^t is. 

15 given to words that are infrequent in the main database but frequent in the user 
BPO data. 

As already stated above, descriptions from DVD, databases, internet, etc. can be 
used to enhance the information in the user history. Also, negative examples can 

20 be used to build up a negative profile that means shows or programs containing 
a special set of words that should not be selected. Furthermore, a function could 
be provided that enables the user to indicate "don*t suggest this show again". It 
might be annoying to get a suggestion for a show that matches the user's profile, 
but that is actually disliked for some other reasons. Also, the EPG articles itself 

25 can be examined whether they contain some negation words like "not" in front of 
keywords. 

It is also possible to consider whether the user watches specific TV shows, e.g. a 
series or movies by a certain director or with a certain (group of) actors, etc. 
30 regularly or (almost) ever, to be suggested regardless of the actual value of the 
OKAPI weight. 

Further, to provide an even more user specific profile that will result in even 
more appropriate suggestions or this user, it is possible to split the user 
35 histories according to the following categories as it is described in more detail 
further below: 
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Maintain different histories for different members of the family. E.g. a child will 
most probably like a different type of TV show than the mother or father does.^ 
Computing a profile from all shows that were watched on one television, 
irrespective of who watched it^ will result in very '^diffuse- suggestions that are 
5 not appropriate for e.g. the father. Using different profiles can ensure to make 
appropriate suggestions depending on the person asking for suggestions. Who is 
actually watching TV could be determined by a login procedure, a simple 
selection from a user Ust, or in case this is not wanted by speaker identification 
techniques in case the device is equipped with a speech interface. Also a camera. 
10 a fingerprint sensor or some other biometric information could be used to 
identify the user. 

Different histories and/or profiles could be maintained depending on the time of 
the day or the day of the week, or any other user feature according to the 

15 present invention. E.g. depending on the day or the time different kinds of shows 
might be watched* This should be captured in the pro£Qe so as to make 
appropriate suggestions when the user requests them for a particular time. In 
case there is none of the above-mentioned possibilities to Identify the user, this 
could alleviate the problem of identification by assuming the different members 

20 of the family have different "typical TV watching hours**. It is to be noted that 
this might be an alternative application of the user history filtering mentioned 
above and further elucidated below. 

If a user request concerned a TV show in the future, e.g. "movie tomorrow 
25 evening" then all movies shown from now until tomorrow evening should be 
considered and, if necessary, recorded. 

Since according to the above-described aspect of the present invention the 
weights of the aspects in the user profile, keywords, time, specific profiles, give a 
30 notion of their relative importance, it is possible to create reasonable user 
profiles on the basis of only a few samples of the user history and also on the 
basis of a large user history. This enables the system to capture the user's long- 
term interests, temporary interests and gradually and abrupt changes of these 
interests. 



In the following a second aspect of the present invention is described in 
connection with an example for the adaptation of a user profile for a TV program 



35 
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suggestion system in reference to Figs- 4a and 4b. In the context of a system 

that „giv.es,.Tecommendations^ -e.g._for_I!V:_programs.>for .a.set . ofLusers, ^e.g.-.in,a^ 

home environment, preferably an evolution of the user profiles is performed- 
Thls might be done, since the user habits are unknown when the system is first 
5 used and should evolve from the state of initial ignorance to a fine-grained user 
profile when much user reference data is available. Further, if there is no 
explicit way of identifying how many users use the recommendation system, the 
automatically generated user profile will be a mixture of the individual profiles 
of all the users. Therefore, according to the present Invention, if sufficient data 
10 is available, a separating of the mixed profile into the underlying individual sub- 
profiles is performed* 

In such a system usually the initial user profile, when using the system for the 
first time, is a generic **stereotype" profile, also Just referred to as stereotype. A 
15 stereotype is the profile of a typical user and is usually conditioned on the 
primary interest of the user, e.g. sports, music, movie, etc. The generic 
stereotype, however, is a mixture of the more specific stereotype profiles and 
describes an average interest in everything, e.g. potentially with some emphasis 
on "blue chips" like Hitchcock movies, BBC, documentaries, Wimbledon Finals, 
20 etc. In addition to the generic stereotype there might be a set of specific 
stereotype profiles, also respectively just referred to as specific stereotype, 
describing interests of a user who focuses on sports or In culture or on 
Hollywood movies, etc. Hie final user profile might be a linear combination of 
the generic stereotype, the specific stereotypes and a user specific profile, i.e. 
user profile, which is usually empty at the beginning and then has an initial 
weight of zero in the linear combination. 

When more user data comes in, this data is used to build up the user profile. 
However, a detailed user profile might need quite some usage data so that after 
only a few days or weeks of usage, there might be not yet be sufficient data 
available in many cases. Therefore, although the weight of the user profile is set 
to increase with increasing the amounts of data, most of the total weight of the 
linear combination of the profiles will come from the specific stereotype profiles, 
while the weight of the generic stereotype is set to decrease. The weights for 
each of the specific stereotypes are computed by matching the user data with 
the data in the respective stereotype and adding weight to the stereotype if a 
match is found. After much data is received, the weight of the generic stereotype 
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will have reached nearly zero, the weight of the specific stereotypes will be low, 
and thfe weight of the' user profile will be near to one. "The user profile in- this 
stage contains fine-grained information about the user's preferences, as 
expressed by his viewing behaviour. The user profile will be continued to be 
5 updated when more data comes in. 

This smooth transition from the generic stereotjrpe profile initially used for 
making suggestions to the user and the final user profile according to the 
present invention Is shown In Fig. 4a which exemplary shows several profiles 
10 and their respective weights after certain times have passed. The available 
profiles are a news profile, a sport profile, a thriller profile, a classical music 
profile, a generic/average user profile, i.e. the generic stereot3rpe, and a user 
specific profile, i.e. the user profile according to the present invention. The news 
profile has an initial associated weight O, i.e. at a start time t = O, a first weight 
15 0 after t « a few hours, a second weight 0.03 after t « 1 day, which steadily 
increases to a third weight 0*05 after t = 1 week, and which steadily decreases to 
a fotirth weight 0 after t = several months. The sport profile has an initial weight 
0 at a start time t = O, a first weight 0.03 after t = a few hours, a second wei^t 
0.09 after t = 1 day which steadily Increases to a third weight 0.1 after t « 1 
20 week, and which steadily decreases to a fourth weight O after t = several months. 
The thriller profile has an initial weight 0 at the start time t = 0, a first weight 
0.01 after t = a few hours, a second weight 0.01 after t = 1 day which steadily 
Increases to a third weight 0.05 after t = 1 week, and which steadily decreases to 
a fourth weight 0 after t = several months. The classical music profile has an 
25 initial weight 0 at a start time t = 0, a first weight 0.06 after t = a few hours, a 
second weight 0.07 after t = 1 day, which steadily increases to a third weight 0,2 
after t = 1 week, and which steadily decreases to a fourth weight O after t = 
several months. The generic stereotype has an initial weight 1.0 at a start time t 
= 0, a first weight 0.9 after t = a few hours, a second weight 0*7 after t = 1 day, 
30 which steadily decreases to a third weight 0.1 after t 1 week, and which 
further steadily decreases to a fourth weight 0 after t = several months. The user 
profile has an initial weight O at the start time t = O, a first weight O after t = a 
few hours, a second weight 0.1 after t = 1 day. which steadily increases to a 
third weight 0.5 after t = 1 week, and which further steadily increases to a 
35 fourth weight 1 after t « several months. 
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summarize and simplify, it is started with a generic stereotype profile 

containing word^welght -palrs-having-aH^e same low weight -X. The more^ the- 
user uses the system, the more word-weight pairs with weight larger than X will 
be contained in the proffles. Therefore, the overall relevance of the Initial generic 
5 word-weight pairs having weight X decreases over time. 

If the recommendation system does not receive the user data without a user 
Identification tag attached to it. i.e. if there is no user identification means 
build-in into the device or available to the device, which features the adaptive 

10 user profile, normally the user profile actuaUy contains a linear combination of 
many sub-user profiles, i.e. of aU use« using the system. According to the 
present Invention preferably these sub-user profiles are separated from the 
multi-user profile, i.e. fi-om the hnear combination of the many sub-user 
profiles. According to the present invention the separation and the selection of ■ 

15 the proper user profile is performed based on the user features, e.g. on side 
infonnation lil^e time of day or time of the week. The separation of a user profile 
mto two sub-profiles is preferably performed as described in the following. 

AS mentioned above, the user profile in itself is defined by a set of weighted 
20 keywords from the EPG data of the shows that the user has actively watched. In 
order to find out a proper split according to the user features, e.g. using a side 
mformatlon feature like time-of-day. a tentative split along an arbitrary time-off- 
day is performed and two sub-user profiles are tentatively generated. These are 
Checked for «.elr relative difference and tixe difference is stored. THcn, a next 
25 tentative split along anotixer tlme-of-day is performed and again tite difference is 
evaluated and stored, and so on. After all possible spUts (or a reasonable 
number of splits, have been trfed. the split yielding «.e highest difference 
between the two sub-user profiles is selected and the difference between the two 
sub-user profiles is analysed. If it exceeds a predefined tiireshold. a split of the 
30 multi-user profile is done, otherwise the multi-user profile remains unchanged 
since it seems to be related to one user only in respect to this user feature. 

The difference between two sub-user profiles can be computed by looking at the 
user profile as a discrete probability distribution over the keywords that are 
36 contained tisercin. After the tentative split, there wm be two such discrete 
probability dlstiibutions. H^e difference between ti.ese two can be computed 
ustag the symmetilzed Kullback-Leibler-distance sum (Pi • log(Pi/Pj), ^here 
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events which happen zero times should be replaced by one virtual occurrence. 
- both to avoid numerical problems and to avoid overstating the importance of 
Singleton events. 

5 Fig. 4b exemplary shows such a spUt. whereto after t > 6 months a spUt of the 
multi-user specific profile, e.g. that generated accordtag to Fig. 4a. is spUt into 
N sub-profiles. i.e. from a user profile 9 a first sub-user profile 10.. a second 
sub-user profile IO3. and so forth untU an N* sub-user profile 10^ is generated, 
-me generation of more than two sub-user profiles is performed by repeating the 

10 above-described procedures of spUtttng a user profile into two sub-user profiles 
on basis of each then generated sub-user profile. l.e splitting a sub-user profile 
into two new sub-user profiles, and so forth. 

In the following a third aspect of the present invention Is described according to 
15 which more appropriate suggestions for an individual user can be made on basis 
of user features to avoid too general suggestions that do not fit the current 
Situation. As an example, a certain user might like sports, thrillers and news, 
but when being together with friends, they also watch game shows. During tiie 
week tiie user usually only watches news and particular sports events, but no 
thrUlers and game shows. The user watches these only during Uie weekend. The 
user profile and/or history thus toclude all four topics. Getting always game 
shows and thrillers suggested from the system is a bit annoytag durtog the 
week, because the user never watches tiiese during the week. On the weekend 
however, it is very useful. 



Therefore, tiie present tovention provides a means to interactively infiuence the 
suggestions made by tiie system to specifically ask. e.g. for some suggestions for 
the weekend or for some game shows right now etc. According to tiie present 
invention the specification is performed according to an actual situation of the 
user represented on basis of the user features, e.g. by time of day. day of week 
(tonight, tomorrow night, weekend), genre, specific shows, mood of ttie user. 
hoUdays. guests, suggestions for family, for myself, evetytiiing until tomorrow 
night, etc. The mood of tiie user might be detected automatically from audio, i.e. 
speech, or vision, i.e. mimic camera, information or other biometric sensor 
35 attached directly to the body. 
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In general, as m«.tlon.^ ab»«, user history contains all the Information 
-ahout -lv. Shows -tte^er-has w.tched...e. ^iPo^ article.- ftlc-genre.- time. -dat.. 
dh-ector. artist, etc. The user profile contains weighted keywords which describe 
his vl.wh« toterests and «*teh are the basis for the act^U search In the EPO ,n 
order to make a specific suggestl^,. On. possible way to Implement the means 

1 t.T"'' f""" «« tte present taveatlon. h. 

the following referred to as mterscthre suggestion engtoc. Is to first filter the 
user history based on the specific actual sltuaUon. or request, from the «,er 
e.g. movle on the weekend, and to compute that m a specific user profile basd 
0 on the fllteicd hlstoiy. as It Is shown m «g. 5a. hi a first step sit ^ 
co^esponds to the first step SI as shown h. Pig. as. the user data U collect ^ 
^2 ™™<'' '° ">« -»nd sup S2 (not Shown 

L?fl„ *" ^' "'"""^ ^«-after. in a 

first fllt«^ step 14a filter settings are generated according to which the user 
. ^ Should be fiHcred. and hi a foUowmg second fl,te«ng step I4b these 

-tung. a« apphed to obtsh. thereafter m a step Sllb a spcclaU.ed user 
hU^. Based on this spedall^d user history a spedah^d user profile " 
derived h. a following step S12b and based thereon finally sLaLd 
suffiesOons are obtahied to a final step SlSb. specahzed 

Of co™-se. also the generated specie user profile can be combmed with the 

prl? 'Z,r"' """" """" ~ '""o'yPe 

profile or with special stereotype profiles by some weighting mechanism. 

Alternatively different user profiles might be created for the same user to respect 
to«.e viewing behaviour to dlffe^t situations. l.e. a spedahsed user profile 
m«h. be Obtained by the filtering. This situation Is shown to mg. 5b. which 

to z t"" " t: '"--'^^ - « as sho * 

in Fig. 3a^ user data Is coUected and to a following second step SIX. which 
corresponds to the second step S2 as shown to mg. 3a. the „L hi torT^ 
computed wher«rpon to a foUowtog step S12. which corresponds to the Ld 
and fourth steps S3 and 84 as shown to r^s. 3a and 3b. the fid. user pr^fiTet 

settings 

according to which the user profile should be filtered and the foUowtog second 
mterli^ st^ S14b to apply the filter setttogs are performed to obtato tfereafie^ 
to a step Slab a spedall^d user profile based on which to a final step S13b 
specialised suggestions are obtatoed. Ihls technique can also be used to ■ 
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differentiate between various profiles for different family members, as the one 
shown and -described in connection with Fig. -Sa. The Identity- of a*- user- could 
also be determined by spealser identification or other blometric techniques, as 
described above. 

5 

Another possibility is to compute all suggestions as usual, i.e. as shown and 
described in connection with Figs. 3a to 3c, using a user profile that contains 
everything, i.e. thriller, news, sports and game shows. The resulting set of 
suggestions is then filtered based on the current requirements of the user. This 

10 case is shown in Fig. 5c, in which first the user data is collected in a first step 
SIO, which corresponds to the first step SI shown In Fig. 3a, thereafter the user 
history is computed in a second step Sll, which corresponds to the second step 
S2 shown in Fig. 3a, based thereon the user profile is derived in a following step 
S12, which corresponds to the third and fourth steps S3 and S4 in Figs. 3a and 

15 3b, and based thereon the suggestions are obtained in a step S13, which 
corresponds to the fifth step S5 shown in Fig. 3c. The first filtexlng step S14a to 
set up the filter settings is only performed after the suggestions are obtained 
and the filter settings are set up in respect to the obtained suggestions. In the 
following second filtering step SI 4b the filter settings are applied to obtain in a 

20 final step SI 3b the specialized suggestions. 

In both latter cases the user profile Iceeps the information that the user watches 
thrillers on Saturday and Sunday only and news and sports during the week 
etc., and either the query is composed such that only the relevant factors are 
25 taken into account, as It is shown and described In connection with Fig. 5b, all 
the query is done as before using the complete profile and the results are then 
filtered later, as it is shown and described in connection with Fig. 5c. 

As mentioned above, the first possibility shown and described in connection with 
30 Fig. 5a is to use only the relevant parts from the history and to compute a new 
user profile therefrom. 

Of coiurse, the current requirements of the user can be explicitly entered by the 
user, e.g. "show me all thrillers for tonight" or they can also be learned 
35 automatically. 
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According to the present Invention, the suggestion engine can react 
appropriateUr-to-user- requests»-e.g_-any.. series" in contrast-to -^any -series- I^ni- 



used to watch-. 



5 Furthermore, the system learns If a user watches a specific show being broad- 
casted seldom (e. g. every 2 months) almost every time. In that case even if this 
show would be suggested based on the profile It would be suggested to the user 
based on above specific rules. 
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Claims 



1. Method to create a user profile that comprises a list of word-weight pairs 
or more general key structures, eharacterissed by the step of computing the 
weights based on user features that represent a lypical general behaviour of an 
individual user in respect to the application where the user profile is used. 

2. Method according to claim 1, cbaracterlsed by the step of evaluating an 
influence of the user features from a user history. 

3. Method according to claim 1 or 2. characterized by the step of dividing 
the user profile into sub user profiles according to characteristics of stereotjrpe 
user profiles. 

4. Method according to anyone of the preceding claims, characterized by 
the step of determining words to be included in the list of word-weight pairs on 
basis of words included in program descriptions of previous selections by the 
user. 

5. Method according to claim 4, characterized by the step of determining 
further words to be included in the list of word-weight pairs on basis of a 
database evaluation of co-occurrences of words already included in the list. 

6. Method according to claim 4 or 5, characterized by the step of computing 
a weight of a word to be included in the hst of word-weight pairs on basis of a 
comparison of an inverse document fi-equency of said word with respect to all 
available program descriptions and an Inverse document frequenqr of said word 
with respect to program descriptions of previous selections by the user. 

7. Method according to claim 4, 5 or 6, characterized by the step of 
computing weights of words to be included in the list of word-weight pairs as a 
product of 

• importance of the word with respect to all program descriptions of 
previous selections by the user, 

and at least one of the following terms: 

• a constant term, in particular one. 
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• importance of the word with respect to the program description in which 
- the word is included* - — - . . 

• importance of the word with respect to all available program descriptions, 
and 

• a correctional factor that depends on the length of the program 
description in which the word is included and the frequency of the word 
in this program description. 

8. Method according to claim 7» characterized in that the importance of a 
word with respect to all program descriptions of previous selections by the user 
is determined on basis of the portion of the word in the set of all words of the 
program descriptions of previous selections by the user. 

9. Method according to claim 8. characterized in that the importance of a 
word with respect to all program descriptions of previous selections by the user 
is determined under consideration of the program description in which the word 
is included. 

10. Method according to claim 7, 8 or 9, characterized in that the 

Importance of the word with respect to the program description in which the 
word is included is determined on basis of occurrences of the word in said 
program description. 

11. Method according to anyone of claims 7, to 10, characterized in that the 
Importance of the word with respect to all available program descriptions is 
determined on basis of its inverse document frequency in respect to all available 
program descriptions. 

12. Method according to claim 11, characterized in that the importance of a 
word with respect to all available program descriptions is determined under 
consideration of the program description in which the word is included. 

13. Method according to anyone of claims 7 to 12, characterized hy the step 
of computing weights of words to be included in the list of word-weight pairs 
according to the following formula: 

H>,(0=(l+^log(-i^-H)).a.log(^^)-( ), 
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wherein w,(t) Is the weight of a word t in a program description i. a is the 

importance.. of. ±he_word«t-_ln- the all. program, descriptions-of previous -selections— 

by the user, t is the word, f^. t is the number of occurrences of the word t in the 
program description i, fi is the document length of the program description 1, f^ is. 
5 the number of program descriptions containing the word t. and N is the number 
of program descriptions in the database. 

14. Method according to anyone of claims 7 to 13, characterized by the step 
of normalizing a weight of a word included in the list of word-weight pairs by a 

10 behaviour of the user to stay with a selection and to switch selections off or to 
switch on/off just for this selection. 

15. Method according to anyone of claims 7 to 14, characterized by the step 
of normalizing a weight of a word included in the list of word-weight pairs by a 

15 behaviour of the user when to actually consume a recorded selection. 

16. Method according to anyone of claims 7 to 15, characterized by the step 
of averaging the weights w,(t) of a word over all program descriptions of previous 
selections by the user in which the word occurs to a weight w(t) of the new 

20 profile. 

17. Method according to anyone of claims 7 to 16, characterized by the step 
of excluding words from the list of word-weight pairs which show a weight below 
a predetermined threshold or within a predetermined lower range of weights. 

25 

18. Method according to anyone of claims 6 to 17, characterized by the step 
of computing a weight of a word to be included in the list of word-weight pairs 
by considering negative selections of the user. 

30 19. Method according to anyone of claims 6 to 18« characterized by the step 
of computing a weight of a word to be included in the list of word-weight pairs 
by considering a maximum frequency of possible user selections in respect to 
certain user features. 

35 20. Method according to anyone of claims 4 to 19, characterized in that the 
created user profile is used as a query in possible future program descriptions to 
suggest at least one possible future selection to the user. - 
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21. Method according to claim 20, characterized in that for each possible 
future program description a modified OEAPI weight is computed, wherein each- 
matching word has its weight as a co*-factor so that the search result is 
influenced according to the user profile. 

S 

22. Method according to claim 21, characterized In that said modified OKAPI 
weight is computed according to the following formula: 

t^t^i ft fut^^fii^'^si4fd 

10 

wherein w(t) is the weight of a word t over all program descriptions of previous 
selections by the user, q is a queiy built according to the user profile, i are all 
program descriptions of possible future selections, w(t) is the weight of the word 
t, ^. t Is the number of occurrences of the word t in the program description i, ^ is 
15 the document length of the program description i, ft is the number of program 
descriptions containing the word t, and N is the number of program descriptions 
in the database. 

23. Method according to claim 21, characterized in that said modified OKAPI 
20 weight is computed according to the following formula: 

OKA,^iq,i)= X H'(0 (l + :^log(-j^+l)). 

wherein w(t) is the weight of a word t over all program descriptions of previous 
25 selections by the user, q is a query built according to the user profile, i are all 
program descriptions of possible future selections, w(t) is the weight of the word 
t, fj^ .t is the number of occurrences of the word t in the program description i, 
and f| is the document length of the program description i. 

30 24. Method according to anyone of claims 20 to 23, characterized in that the 
query is built by combining a stereotype user profile and the created user 
profile, wherein initially, when providing the suggestion for the first time, the 
stereot3rpe user profile is used alone, within a first predetermined period during 
the collection of data for computing the created user profile a linear combination 

35 of both user profiles is used, and after said first predetermined period during 
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the collection of data for computing the created user profile the created user 
— profile is used. - - . . ^ 

25. Method according to claim 24, characterized in that the stereotype user 
5 profile comprises a generic stereotype profile which describes an average interest 

m everything and at least one specific stereotype profile each defining interests 
of a user who focuses on a specific topic. 

26. Method according to claim 25, characterized in that a weight with which 
10 a specific stereotsrpe user profile Is considered during the linear combination to 

provide said suggestion is computed by matching the user selections with the 
data in the respective specific stereotype and increasing the weight of the 
stereotype if a match is found. 

15 27. Method according to claim 24, 25 or 26. characterized In that afl:er a 
second predetermined period the query is built by an individual user profile 
created according to anyone of claims 29 to 32. 

28. Method according to anyone of claims 20 to 27, characterized in that the 
20 query is built under consideration of a filtered user profile according to tilaim 

33. 

29. Method to create an Individual user profile from a multi-user profile that 
comprises a list of word-weight pairs» characterized by the step of at least once 

25 spUtting the multi-user profile according to user features that represent a 
typical general behavioiu* of an individual user in respect to the application 
where the user profile is used. 

30. Method according to claim 29, characterized in that for splitting the 
30 multi-user profile the following steps are performed: 

e) performing a tentative split according to user featinres to generate a first 
and a second sub user profiles, 

f) calculating the relative difference between the two sub user profiles, 

g) performing steps a) and b) until all or a predetermined number of 
35 tentative splits are performed, and 
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h) splitting the multi-user profile according to that tentative split that yields 

the highest relative diiference in case said relative difference lies above .a 

predetermined threshold. 

5 31. Method according to claim 30, characterized in that said relative 
difference is calculated by calculating a difference of a first discrete probabllily 
distribution of the first sub user profile over the user features that are contained 
therein and of a second discrete probability distribution of the second sub user 
profile over the user features that are contained therein. 

10 

32. Method accordii^ig to claim 31, characterized in that said difference of 
said two discrete probabiUty distributions is calculated using the $3mimetrized 
Kulback-Leibler-distance sum, where events which happen zero times are 
replaced by one virtual occurrence or where only events which happen at least 

15 once in both distributions are taken into account. 

33. Method to specii^ a suggestion for a next selection of a user, which 
suggestion is determined on basis of suggestion results which are computed of 
future program descriptions and a user profile, characterized by the step of 

20 filtering 

• a user history which is used to create the user profile, and/ or 

• the user profile, and/or 

• the suggestion results 

based on an actual situation of the user represented on basis of user features 
25 that represent a typical general behaviour of an Individual user in respect to the 
application where the user profile is used. 

34. Method according to anyone of the preceding claims, characterized in 
that said general key structure includes a forgetting factor. 

30 

35. Method according to anyone of the preceding claims, characterized in 
that a future program comprises a stored personal content. 

36. Method according to anyone of the preceding claims, characterized in 
35 that it is used in an audio/video program suggestion engine. 
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37. Method according to claim 36. characterized in that said audio/video 
pFogram suggestion engine Is-intemet based 

38. Method according to anyone of the preceding claims, characterized in 
5 that it is cUent based. 

39. Method accordtog to anyone of the preceding claims, characterized in 
that said user features comprise one or more of the foUowlng features: 

• preferred channel of audio/video program consumed by the user, 

• typical time to consume an audio/video program by the user. 

• length of consuming an audio/video program by the user in relation to 
the total length of the audio/video program. 

. tfane of beginning the consume of an audio/video program by the user in 

relation to a start time of the audio/video program, 
. typical length of consuming an audio video program by the user in 
relation to the time of consuming, 

• relation between how often a particular audio/video program is 
consumable and how often it is consumed by the user, 

. general audio/video program consumhig behaviour of the user in 
20 particular in relation to a switch-on time and length of a used 

audio/video device, 

• audio/video programs recorded by the user, 

. time duration between the recording of a particular audio/video program 
by the user and the consuming of said audio/video program by the user, 
25 • actual mood of the user, 

• actual wish of audio/video program entered by the user, 

• year of production of an audio/video program consumed by the user, 

. director and/or actor and/or group of actors of an audio/video program 
consumed by the user, 

• type of an audio/video program consumed by the user, and 

• title of an audio/video program consumed by the user. 

40. Computer program product, comprising computer program means adapted 
to perform the method steps as defined in anyone of the preceding claims when 
35 being executed on a computer, micro processor, digital signal processor, or the 
like including home server, set-top-box. TV. VCR, PDA. 
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41. Computer readable storage medium, storing thereon a computer program 
-product according to claim 40. 

42. Profiler to create a user profile that comprises a Ust of word-weight pairs. 
5 characterized by betog adapted to perform the method steps as defined In 

anyone of the preceding claims 1 to 39. 

43. Suggestion engine to specify a suggestion for a next selection of a user, 
which suggestion Is determined on basis of suggestion results which are 

10 computed of future program descriptions and a user profile, characterized by 
being adapted to perform the method steps as defined in anyone of the preceding 
claims 33 to 37. 



44. suggestion engine according to claim 43. characterized by a profiler 
15 according to claim 43. 
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Methods to croate a user profile and to specify a suggestion for a next 

selection of a user 

A user profile and/or the suggestions computed based thereon are obtained 
taking a special set of user features into account. The user features, are defined 
to represent a typical general behaviour of an individual user In respect to the 
application where the user profile is used. In other words, for each application 
where a user profile Is used a special set of user features are defined which are 
able to represent a typical general behaviour of an individual user. Based on 
these user features the weights in the list of word-weight pairs or weighted 
keywords which represents the user profile are computed or Influenced during 
the creation of the user profile, and/or a multi-user profile Is split during the 
creauon of an ^dividual user profile from a multi-user profile, and/or during 
specification of a suggestion a user history which Is used to create the user 
profile, and/or the user profile, and/or the suggestion insults are filtered. 
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